Privacy Server for Protecting Personally Identifiable Information

ABSTRACT

A privacy server protects private information by substituting a token or an identifier for the private information. The privacy server recognizes that a communication includes private information and intercepts the communication. The privacy server replaces the private information with a random or pseudo-random token or identifier. The privacy server maintains the private information in a local database and associates the private information for a particular person with the token or identifier for that person.

FIELD OF THE INVENTION

The invention generally relates to the protection of personally identifiable information and more particularly to the substitution of a token or identifier for the personally identifiable information by a privacy server.

BACKGROUND

The protection of personally identifiable information (PII) is of concern as more and more information is stored and shared electronically. There are a number of laws that govern how PII can be used and how it must be protected. For example, some Canadian provinces have enacted laws to address how private electronic information, such as PII, collected by public Canadian institutions can be handled. These laws require that private data not cross provincial or Canadian borders, or be accessed by citizens or authorities of other countries. These types of laws may limit the ability of provincial residents to use applications or services that are hosted outside of the province if the application requests PII. It is not uncommon for the server that hosts an application or service to be located in a different jurisdiction from the user of the application or service. For example, a provider of a learning management system (LMS) may host the software for the LMS on a server in one jurisdiction, but serve students in a number of other jurisdictions. However, the LMS may not be able to serve students of an institution located in a jurisdiction that prohibits the transmission of the student's PII outside the jurisdiction when the LMS is located outside the jurisdiction.

To accommodate laws that prohibit the transmission of PII outside the jurisdiction or otherwise require special handling of PII, a provider can choose to host its application or service within that jurisdiction or to provide specific applications or services to address the special handling required by that jurisdiction. However, these approaches increase cost and complexity for the provider. Alternatively, a user or institution can choose to limit themselves to only those applications and services that are hosted within their jurisdiction or that provide the special handling of PII, but in doing so may deprive themselves of access to the best available resources.

Since users want access to the best available applications and solutions regardless of where they may be hosted, there is a need for a solution that protects PII without requiring separate hosting or special handling for different jurisdictions.

SUMMARY

Aspects of the invention relate to a privacy server and methods of operating the privacy server to protect private information. A privacy server interfaces with a user's computer system and another server, such as an application server. The privacy server protects private information received from the user's computer system by sending a token or identifier to the application server instead of the private information.

The privacy server recognizes when the user is communicating private information to the application and intercepts the communication. The privacy server replaces the private information with a random or pseudo-random token or identifier. The privacy server maintains the private information in a local database and associates the private information for a particular person with a token or identifier for that person. Communications from the application to the user also pass through the privacy server. If a communication includes the token or identifier, then the privacy server intercepts the communication and replaces the token with the user's private information prior to presenting the communication to the user.

Other features, advantages, and objects of the present invention will be apparent to those skilled in the art with reference to the remaining text and drawings of this application.

BRIEF DESCRIPTION OF THE FIGURES

These and other features, aspects, and advantages of the present disclosure are better understood when the following Detailed Description is read with reference to the accompanying drawings, where:

FIG. 1 is a block diagram illustrating an exemplary privacy server.

FIG. 2 is a block diagram illustrating an exemplary registration process using a privacy server.

FIG. 3 is a block diagram illustrating another exemplary registration process using a privacy server.

FIG. 4 is a block diagram illustrating yet another exemplary registration process using a privacy server.

FIG. 5 is a flow diagram illustrating an exemplary process for creating a token.

FIG. 6 is a block diagram illustrating an exemplary web page process using a privacy server.

FIG. 7 is a block diagram illustrating another exemplary web page process using a privacy server.

FIG. 8 is a block diagram illustrating an exemplary e-mail process using a privacy server.

FIG. 9 is a block diagram illustrating another exemplary e-mail process using a privacy server.

DETAILED DESCRIPTION

Aspects of the present invention are directed to a privacy server that maintains private information at the privacy server while using applications or services hosted on other servers. In some instances, the privacy server can be used to maintain private data within one jurisdiction even though a user is accessing an application hosted on a server located outside the jurisdiction. The privacy server intercepts communications between a user and an application that include private information. The privacy server creates a token or identifier that does not disclose the private information and uses the token or identifier instead of the private information when communicating with other servers and systems. In some instances, the operation of the privacy server in substituting a token or identifier for private information may be transparent to both the user and to the other servers and systems.

The scope of private information may vary, but generally includes any information unique to an individual, such as name, home address, opinions, educational records, age, gender, income, medical records, and/or financial data. The terms private information and personally identifiable information (PII) are used interchangeably herein. Information that is not private, i.e., not identified or linked to an individual is referred to herein as anonymous.

Exemplary Operating Environment

FIG. 1 illustrates an exemplary relationship between the user's computer system 102, the privacy server 104, and the application server 106. Both the user's computer system 102 and the privacy server 104 are located in the same jurisdiction. The application server 106 is located in a different jurisdiction. FIG. 1 illustrates that the user's computer system 102 and the privacy server 104 are located in Canada and that the application server 106 is located in the United States. Although FIG. 1 illustrates that the jurisdictions are based on country boundaries, other types of jurisdictional boundaries may be used. For example, if a state or a province has privacy requirements that are more stringent than the applicable national privacy requirements, then the user's computer system and the privacy server may be in one state or province and the application server may be in a different state or province, but all may be located in the same country.

Although not shown in FIG. 1, the application server can be connected to additional privacy servers and/or connected directly to additional user systems. The additional privacy servers and/or user systems may be located in the same or additional jurisdictions. The privacy server may be configured to allow it to interact with an application or application server without the application or application server recognizing that it is interacting with the privacy server.

A user may access an application hosted on the application server, such as a cloud-based application, i.e., an application resident on the application server and accessible via a network, such as the Internet. The user's communications with the application pass through the privacy server. The privacy server recognizes when the user is communicating PII to the application and intercepts the communication. The privacy server replaces the PII with a random or pseudo-random token or identifier. The privacy server maintains the PII in the local PII database 105 and associates the PII for a particular user with an identifier for that user. An identifier, such as a PII identifier, maybe a random or pseudo-random string. The privacy server may decorate the PII identifier to create a token. The application hosted by the application server receives the token from the privacy server and uses it to identify a user. In some instances, the application does not distinguish between a received token and received PII, which may allow an existing application to work with users that access the application via a privacy server, as well as users that access the application directly.

The application may maintain the token in the database associated with the application server, illustrated by the LMS database in FIG. 1. If the application is capable of receiving both tokens and PII, then the tokens are maintained in a manner similar to that used to maintain PII. The LMS database is distinct from the PII database so that the application does not receive or use the user's PII.

Any communication from the application to the user also passes through the privacy server. If the communication includes a token, then the privacy server intercepts the communication and replaces the token with the user's PII prior to presenting the communication to the user. In this manner, the existence and operation of the privacy server is transparent to the user.

Although FIG. 1 illustrates one possible configuration, the features discussed herein are not limited to any particular hardware architecture or configuration. The user's computer system, the privacy server and the application server may include a computing device, as well as a non-transitory computer-readable medium capable of storing code and may be capable of the operations described herein. One example of a computing device is a multipurpose computer system capable of executing software or other code. Examples of non-transitory computer-readable medium include electronic, optical, magnetic, or other storage device capable of storing computer-readable instructions. Other examples include, but are not limited to, a floppy disk, CD-ROM, DVD, magnetic disk, memory chip, ROM, RAM, an ASIC, or any other medium from which a computer processor can read instructions. The user's computer system, the privacy server, and the application server may communicate via any type of a network including, but not limited to, a wired or wireless network.

Exemplary Operation

FIG. 1 illustrates that the privacy server may include modules to support a PII registration process 112, as well as a PII proxy process 110 and an e-mail relay process 114. The operation of the privacy server will now be described with reference to a registration process where the privacy server generates and uses a token. A non-limiting example of an educational application, such as a Learning Management System (LMS), that is capable of using student PII is used for illustration.

In this example, a teacher registers a student by entering the student's information, including PII, via a system, such as the user's computer system 102 of FIG. 2. The teacher may enter the information by uploading a spreadsheet that includes information for one or more students or may enter the information via a registration form or page. The privacy server recognizes that student information is being entered as part of the registration process and intercepts the information. In some instances, the privacy server recognizes that a spreadsheet has been uploaded that is associated with PII. In other instances, the privacy server recognizes that the teacher is accessing a registration form or page. In one exemplary implementation, the privacy server is programmed to recognize that a particular form or web page is being loaded and intercepts the PII entered into the form or web page prior to communicating it to the application server.

The teacher is unaware that the privacy server is intercepting any student information. The teacher interacts with the application in the same manner as the teacher would if there was no privacy server. As will be apparent to those skilled in the art, there are other ways that the privacy server may recognize PII, including, but not limited to being configured to recognize certain actions or sequences associated with a user's interaction with the application or to recognize certain types of information.

Once the privacy server intercepts the PII, the privacy server saves the PII locally and generates a PII identifier. In this example, the PII includes the student's name, John Smith, and the student's e-mail address, “john_smith@myschool.edu.ca”. The student's name and e-mail address are saved in the PII database and are associated with the PII identifier, which in this example is “12345”.

The PII identifier is a random or pseudo-random character string. The character string can be an alphabetic character string, a numeric character string, or an alpha-numeric character string. The PII is not used to generate the PII identifier. Instead other types of information, including, but not limited to the time of day or a portion of the network address of the user's computer system may be used to generate the PII identifier. The PII identifier uniquely identifies the student or other entity within the scope of the privacy server. The PII identifier may be generated by a computer-implemented method provided by the privacy server. One example of a PII identifier is a GUID or globally unique identifier. In some instances the PII identifier includes designated characters that can be used for sorting. For example, a PII identifier may include characters that represent the first few letters of the student's sur name to support sorting alphabetically by sur name. In this instance the PII identifier is not completely random, but still protects the student's PII.

As shown in FIG. 2, the privacy server 102 may use attributes or attribute codes to identify different types of PII. For example, the student's full name may be associated with an attribute for full name and may be identified by an attribute code of “FN”, the student's sur name may be may be associated with an attribute for sur name and identified by an attribute code of “SN”, and the student's given name may be associated with an attribute for given name and identified by an attribute code of “GN”. Attributes and attribute codes may be helpful when the PII identifier represents more than one type of information. The manner in which the student's PII is partitioned and the types of attributes and attribute codes are typically based upon the requirements of the application. If the application uses a student's given name, sur name, and full name, then the attributes and attribute codes may be set up as illustrated in FIG. 2.

FIG. 2 illustrates that the privacy server may create a token by decorating the PII identifier. The privacy server may decorate the token by adding a start code and/or an end code to the beginning and/or ending of the PII identifier to indicate the beginning and/or ending of the token. FIG. 2 illustrates a start code of “#@” and an end code of “@#”. However, any suitable start or end code may be used. In the alternative or in addition, the privacy server may decorate the token by adding an attribute code to the PII identifier to identify the type of PII. For example, FIG. 2 illustrates that the privacy server may add “FN” to indicate the student's full name, “SN” to indicate the student's sur name, and/or “GN” to indicate the student's given name.

The privacy server sends the token to the application server 106 instead of the student's PII. For example, if the teacher uploaded a spreadsheet with a student's full name, then the privacy server replaces the student's full name with a token and sends the spreadsheet with the token to the application server. In this example, the token may be “#@FN:12345@#”. Similarly, if the teacher enters student PII into a registration form, then the privacy server replaces the student's full name with a token before sending the registration form to the application server. Only the token is sent to the application server. The privacy server does not send the student's PII to the application server.

The application receives the spreadsheet or registration form from the privacy server and registers the student with the application by storing the token in the LMS database 107. The application treats the token as a student identifier. The token is maintained in the LMS database so that the student's performance and progress can be tracked. In some instances the application, receives tokens for some students and PII for other students. If so, then the application treats the tokens in the same manner that it treats student PII. One benefit of the privacy server may be that the application does not need to be changed to protect PII since the protection is provided by the privacy server.

In other instances, the application server only stores the PII identifier, not each token. One example of this is shown in FIG. 3 where the application server includes a data layer 302. The data layer receives a token, removes the decoration, i.e., removes any start/end codes and any attribute codes, and passes only the PII identifier to the database 107 for storage. The data layer also passes the e-mail domain of the privacy server to the database so that it is associated with the PII identifier. Prior to the application server communicating with the privacy server, the data layer may decorate the PII identifier so that the appropriate token is sent to the privacy server.

In yet another instance, the privacy server does not decorate the PII identifier. As illustrated in FIG. 4, the privacy server creates a PII identifier, “12345”, for a student and sends the PII identifier and the e-mail domain of the privacy server, “privacy_server.com.ca”, to the application server. The data layer stores the PII identifier and the e-mail domain in the local database. The data layer uses the PII identifier and the e-mail domain to create an e-mail address when sending an e-mail to the student. In the examples illustrated by FIGS. 3 and 4, the application is aware that it is receiving a PII identifier from the privacy server and the data layer is capable of decorating the PII identifier or assembling an e-mail address using the PII identifier.

In some instances, the user may provide information or data to the application that does not need to be protected, such as anonymous information. If so, the privacy server allows the anonymous information or data to pass unaltered to the application server. If PII is provided along with anonymous information that does not need privacy protection, then the privacy server only substitutes a token or PII identifier for the PII and allows the anonymous information to pass unaltered to the application server. For example, if a registration form requests a student's name in one information field and a class name in another information field, then the privacy server may replace the student's name with a token, but allow the class name to pass through to the application server.

When the privacy server receives a communication from the application server that includes a token, the privacy server uses the token to locate the PII that corresponds to the token stored in the PII database and substitutes the appropriate PII. For example, if the teacher requests a report for a class of students, then the teacher may provide a class identifier, such as a class name or course number to the application. The application generates a report that includes the tokens for the names of the students in the class. The privacy server intercepts the report and substitutes the students' names for the tokens prior to providing the report to the teacher.

In some implementations, the privacy server determines that a communication includes a token by scanning the communication for token delimiters, such as a start and/or end code. In other implementations, the privacy server may be designed to scan particular document types, particular documents or particular web pages for tokens.

An exemplary method for generating a token is illustrated by FIG. 5. The method starts when the privacy server receives a communication from the user's computer system in 502. The privacy server determines whether the communication includes PII in 504. For example, the privacy server may determine whether the communication includes a field associated with PII. If the communication does not include PII, then the method proceeds to 506 and the communication is forwarded without any modification. If the determination in 504 is that the communication includes PII, then the method proceeds to step 508 and the privacy server intercepts the communication. The privacy server extracts the PII in 510 and creates a PII identifier in 512. The privacy server associates the PII identifier with the PII in 514 and stores the PII identifier and the PII in a local database in 516. The privacy server creates a token by decorating the PII identifier in 517. Alternatively, the privacy server may use the PII identifier instead of a token. The privacy server substitutes the token for the PII in 518 and then forwards the communication with the token in 520 to the application server.

The privacy server may also serve as a proxy between a user and the application, as well as other applications or services since all communications from the user pass through the privacy server. This function is similar to that currently used to filter web traffic. When a user requests a web page, the web page request passes through the privacy server. If the web page request includes PII, then the privacy server replaces with PII with a token or PII identifier before the request is forwarded to the appropriate server.

Regardless of whether the web page request includes PII, the web page returned from the application server may include a token or PII identifier. The local database for the application server stores a token or a PII identifier instead of PII. If the web page includes an information field associated with PII, such as a name field, then the application server inserts the token or PII identifier into the field. The token or PII identifier can be retrieved from the local data base or the data layer can retrieve a PII identifier from the local data base and decorate it. When the privacy server receives the web page, it replaces the token or PII identifier with the appropriate PII prior to providing the web page to the user. In this manner the user receives a web page which includes the user's PII, even though the PII was never provided to the application server.

FIG. 6 illustrates an exemplary web page request. The user sends a communication to the application server requesting the web page. The communication does not includes any PII so the communication is forwarded to the application server without any modifications. The application server determines that the web page is to include the full name of a student known to the application as “12345”. The application server accesses the LMS database to retrieve the full name of student “12345”. In this case, the LMS database includes a token for the full name of the student “#@FN:12345@#”. The application server includes the token in the field of the web page associated with the full name of the student. When the privacy server receives the web page, the privacy server recognizes that the web page contains a token and replaces the token with the appropriate PII. In this case the privacy server replaces the token “#@FN:12345@#” with the student's full name “John Smith” prior to presenting the web page to the user.

FIG. 6 illustrates that the LMS database stores a token. In other instances, the LMS database may store the PII identifier and the data layer may decorate the PII identifier. For example, FIG. 7 illustrates the situation where the application server determines that the web page is to include the full name of a student known to the application as “12345”. The data layer checks the LMS database to determine whether the database includes information for student “12345”. If student “12345” exists in the LMS database, then the data layer decorates “12345” prior to sending the web page to the privacy server. In the example illustrated by FIG. 7, the data layer decorates “12345” with start and end codes, as well as an attribute code for the student's full name.

The privacy server may also act as an e-mail relay since it can substitute an e-mail address the uses a PII identifier and the privacy server's e-mail domain for the user's e-mail address or forward an e-mail to the user's e-mail address that was received at the privacy server. FIG. 8 illustrates an example where the application initiates communication with a student or other user by sending an e-mail. Since the student is only identified to the application by a PII identifier, the application looks up the e-mail address for the student using the PII identifier. The application determines that the e-mail address for the student identified by PII identifier “12345” is “12345@privacy_server.com.ca”. The application sends an e-mail to the student using that address. When the e-mail is received at the privacy server, the privacy server recognizes that the e-mail address includes the privacy server's e-mail domain. The privacy server uses the PII identifier to determine the corresponding e-mail address for the student and then substitutes the student's e-mail address for the received e-mail address. In the example illustrated by FIG. 8, the privacy server substitutes “john_smith@myschool.edu.ca” for “12345@privacy_server.com.ca” before sending the e-mail on to the student. The student receives an e-mail addressed to the student's e-mail address with content created by the application even though the application did not know the student's e-mail address.

FIG. 9 illustrates another example where the application initiates communication with a student or other user by sending an e-mail. Since the student is only identified by a PII identifier, the data layer looks up the e-mail address for the student using the PII identifier. The data layer determines that the student identified by PII identifier “12345” is associated with a privacy server that has an e-mail domain of privacy_server.com.ca. The data layer then addresses the e-mail to “12345@privacy_server.com.ca”. When the e-mail is received at the privacy server, the privacy server uses the PII identifier to determine the corresponding e-mail address for the student and then substitutes the student's e-mail address for the received e-mail address. In the example illustrated by FIG. 9, the privacy server substitutes “john_smith@myschool.edu.ca” for “12345@privacy_server.com.ca” before sending the e-mail on to the student. The student receives an e-mail addressed to the student's e-mail address, but created by the application even though the application did not know the student's e-mail address.

The foregoing description of exemplary embodiments of the invention has been presented only for the purposes of illustration and description and is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to explain the principles of the invention and their practical application to enable others skilled in the art to utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. Alternative embodiments will become apparent to those skilled in the art to which the present invention pertains without departing from its spirit and scope. For example, although the examples describe that the user is a teacher entering information about students, the user could be entering information about itself. In addition, the invention is not limited to a LMS or other educational application, but can be used with any system where privacy or protection of PII is a concern. 

What is claimed is:
 1. A method comprising: receiving a communication from a user's computer system, wherein the communication includes a plurality of information fields; determining whether the communication includes an information field directed to personally identifiable information (PII); if the communication does not include any information fields directed to PII, then forwarding the communication; if at least one of the information fields in the communication is directed to PII, then intercepting the communication; extracting information from the at least one information field; creating a PII identifier, wherein the PII identifier is a pseudo-random character string; associating the PII identifier with the extracted information; creating a token by decorating the PII identifier with at least one code; substituting the token for the information in the at least one information field directed to PII to create a second communication; and forwarding the second communication to an application server.
 2. The method of claim 1, wherein at least one of the information fields in the communication is directed to PII, at least another one of the information fields in the communication is directed to anonymous information, and the second communication includes the anonymous information.
 3. The method of claim 1, wherein associating the PII identifier with the PII, comprises storing the PII identifier and the PII in a local database.
 4. The method of claim 1, wherein the PII identifier is a unique identifier within a scope of a privacy server that receives the communication from the user's computer system.
 5. The method of claim 1, wherein determining whether the communication includes an information field directed to PII, comprises determining that the communication involves a predetermined form or a predetermined web page.
 6. The method of claim 1, further comprising: receiving a third communication from the application server; determining whether the third communication includes the token; if the third communication includes the token, then intercepting the third communication; determining that the token is associated with the extracted information; substituting the extracted information for the token to create a fourth communication; and forwarding the fourth communication to the user's computer system.
 7. The method of claim 1, wherein decorating the PII identifier with at least one code, comprises decorating the PII identifier with a start code and an end code.
 8. The method of claim 7, wherein decorating the PII identifier with at least one code, further comprises decorating the PII identifier with an attribute code that identifies a type of PII.
 9. A method comprising: receiving a communication from a user's computer system, wherein the communication includes a plurality of information fields; determining whether any of the information fields in the communication include personally identifiable information (PII); if at least one of the information fields in the communication includes an information field associated with PII, then intercepting the communication; extracting information from the information field associated with PII; creating a pseudo-random PII identifier; associating the PII identifier with the extracted information in a local database; substituting the PII identifier for the extracted information in the information field associated with PII to create a second communication; and forwarding the second communication to an application server.
 10. The method of claim 9, wherein the PII identifier is a unique identifier within a scope of a privacy server that receives the communication from the user's computer system.
 11. The method of claim 9, wherein the communication includes an information field associated with anonymous information, and wherein the second communication includes the anonymous information received from the user's computer system.
 12. The method of claim 9, wherein determining whether any of the information fields in the communication include personally identifiable information (PII) comprises determining that the communication is associated with a registration process.
 13. The method of claim 9, further comprising: receiving a third communication from the application server, wherein the third communication includes the PII identifier in a data field; intercepting the third communication; determining that the PII identifier is associated with the extracted information; substituting at least a portion of the extracted information for the PII identifier to create a fourth communication; and forwarding the fourth communication to the user's computer system.
 14. A privacy server, comprising: a computing device configured to: receive a communication from a user's computer system, wherein the communication includes a plurality of information fields; determine that the communication is associated with personally identifiable information (PII); intercept the communication; extract information from a first information field that is directed to PII; create a PII identifier; associate the PII identifier with the extracted information; substitute the PII identifier for the information in the first information field to create a second communication; and forward the second communication to an application server; and a storage device configured to locally store the PII identifier and the extracted information.
 15. The privacy server of claim 14, wherein the computing device is further configured to: determine that a second information field in the communication is directed to anonymous information; and include the anonymous information in the second information field of the second communication.
 16. The privacy server of claim 14, wherein the computing device is further configured to: receive a third communication from the application server; determine that the third communication includes the PII identifier in a data field; determine that the PII identifier is associated with the extracted information; substitute at least a portion of the extracted information for the PII identifier in the data field to create a fourth communication; and forward the fourth communication to the user's computer system.
 17. The privacy server of claim 14, wherein the computing device is configured to determine that the communication is associated with personally identifiable information (PII) by determining that the communication involves a predetermined form or predetermined web page.
 18. The privacy server of claim 14, wherein the computing device is configured to communicate the extracted information only between the privacy server and the user's computer system.
 19. The privacy server of claim 14, wherein the computing device is further configured to decorate the PII identifier to create a token and the storage device is configured to locally store the token.
 20. A method comprising: receiving a communication that includes a token, wherein the token comprises a personally identifiable information (PII) identifier decorated with at least one code; extracting the PII identifier from the communication; storing the PII identifier; creating a second communication by: determining that the second communication includes a data field directed to PII; retrieving the PII identifier; creating a second token by decorating the PII identifier with the at least one code; inserting the second token into the data field of the second communication; and sending the second communication to a privacy server. 